341 research outputs found

    SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers

    Full text link
    In recent years, Speech Emotion Recognition (SER) has been investigated mainly transforming the speech signal into spectrograms that are then classified using Convolutional Neural Networks pretrained on generic images and fine tuned with spectrograms. In this paper, we start from the general idea above and develop a new learning solution for SER, which is based on Compact Convolutional Transformers (CCTs) combined with a speaker embedding. With CCTs, the learning power of Vision Transformers (ViT) is combined with a diminished need for large volume of data as made possible by the convolution. This is important in SER, where large corpora of data are usually not available. The speaker embedding allows the network to extract an identity representation of the speaker, which is then integrated by means of a self-attention mechanism with the features that the CCT extracts from the spectrogram. Overall, the solution is capable of operating in real-time showing promising results in a cross-corpus scenario, where training and test datasets are kept separate. Experiments have been performed on several benchmarks in a cross-corpus setting as rarely used in the literature, with results that are comparable or superior to those obtained with state-of-the-art network architectures. Our code is available at https://github.com/JabuMlDev/Speaker-VGG-CCT

    Scaling of the Critical Function for the Standard Map: Some Numerical Results

    Full text link
    The behavior of the critical function for the breakdown of the homotopically non-trivial invariant (KAM) curves for the standard map, as the rotation number tends to a rational number, is investigated using a version of Greene's residue criterion. The results are compared to the analogous ones for the radius of convergence of the Lindstedt series, in which case rigorous theorems have been proved. The conjectured interpolation of the critical function in terms of the Bryuno function is discussed.Comment: 26 pages, 3 figures, 13 table

    A Sparse and Locally Coherent Morphable Face Model for Dense Semantic Correspondence Across Heterogeneous 3D Faces

    Get PDF
    The 3D Morphable Model (3DMM) is a powerful statistical tool for representing 3D face shapes. To build a 3DMM, a training set of face scans in full point-to-point correspondence is required, and its modeling capabilities directly depend on the variability contained in the training data. Thus, to increase the descriptive power of the 3DMM, establishing a dense correspondence across heterogeneous scans with sufficient diversity in terms of identities, ethnicities, or expressions becomes essential. In this manuscript, we present a fully automatic approach that leverages a 3DMM to transfer its dense semantic annotation across raw 3D faces, establishing a dense correspondence between them. We propose a novel formulation to learn a set of sparse deformation components with local support on the face that, together with an original non-rigid deformation algorithm, allow the 3DMM to precisely fit unseen faces and transfer its semantic annotation. We extensively experimented our approach, showing it can effectively generalize to highly diverse samples and accurately establish a dense correspondence even in presence of complex facial expressions. The accuracy of the dense registration is demonstrated by building a heterogeneous, large-scale 3DMM from more than 9,000 fully registered scans obtained by joining three large datasets together

    Macro-and Micro-Expressions Facial Datasets: A Survey

    Get PDF
    Automatic facial expression recognition is essential for many potential applications. Thus, having a clear overview on existing datasets that have been investigated within the framework of face expression recognition is of paramount importance in designing and evaluating effective solutions, notably for neural networks-based training. In this survey, we provide a review of more than eighty facial expression datasets, while taking into account both macro-and micro-expressions. The proposed study is mostly focused on spontaneous and in-the-wild datasets, given the common trend in the research is that of considering contexts where expressions are shown in a spontaneous way and in a real context. We have also provided instances of potential applications of the investigated datasets, while putting into evidence their pros and cons. The proposed survey can help researchers to have a better understanding of the characteristics of the existing datasets, thus facilitating the choice of the data that best suits the particular context of their application

    Scaling law in the Standard Map critical function. Interpolating hamiltonian and frequency map analysis

    Get PDF
    We study the behaviour of the Standard map critical function in a neighbourhood of a fixed resonance, that is the scaling law at the fixed resonance. We prove that for the fundamental resonance the scaling law is linear. We show numerical evidence that for the other resonances p/qp/q, q≥2q \geq 2, p≠0p \neq 0 and pp and qq relatively prime, the scaling law follows a power--law with exponent 1/q1/q.Comment: AMS-LaTeX2e, 29 pages with 8 figures, submitted to Nonlinearit

    Aluminium electrodeposition from ionic liquid: Effect of deposition temperature and sonication

    Get PDF
    Since their discovery, ionic liquids (ILs) have attracted a wide interest for their potential use as a medium for many chemical processes, in particular electrochemistry. As electrochemical media they allow the electrodeposition of elements that are impossible to reduce in aqueous media. We have investigated the electrodeposition of aluminium from 1-butyl-3-methyl-imidazolium chloride ((Bmim)Cl)/AlCl3 (40/60 mol %) as concerns the effect of deposition parameters on the quality of the deposits. Thick (20 μm) aluminium coatings were electrodeposited on brass substrates at different temperatures and mixing conditions (mechanical stirring and sonication). These coatings were investigated by means of scanning electron microscope, roughness measurements, and X-ray diffraction to assess the morphology and the phase composition. Finally, electrochemical corrosion tests were carried out with the intent to correlate the deposition parameters to the anti-corrosion properties

    Chaotic systems in complex phase space

    Full text link
    This paper examines numerically the complex classical trajectories of the kicked rotor and the double pendulum. Both of these systems exhibit a transition to chaos, and this feature is studied in complex phase space. Additionally, it is shown that the short-time and long-time behaviors of these two PT-symmetric dynamical models in complex phase space exhibit strong qualitative similarities.Comment: 22 page, 16 figure
    • …
    corecore